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Abstract. We present an XMM-Newton mosaic observation of the hot (kT ~ 6.5 keV) and nearby (z = 0.0881) relaxed cluster 
of galaxies A478. We derive precise gas density, gas temperature, gas mass and total mass profiles up to 12' (about half of the 
virial radius /?20o)- The gas density profile is highly peaked towards the center and the surface brightness profile is well fitted 
by a sum of three j8-models. The derived gas density profile is in excellent agreement, both in shape and in normalization, with 
' the published Chandra density profile (measured within 5' of the center). Projection and PSF effects on the temperature profile 

1^ I determination are thoroughly investigated. The derived radial temperature structure is as expected for a cluster hosting a cooling 

core, with a strong negative gradient at the cluster center. The temperature rises from ~ 2 keV up to a plateau of ~ 6.5 keV 
beyond 2' (i.e. r > 208kpc = 0.1 R2oo> ^200 = 2.08 Mpc being the virial radius). From the temperature profile and the density 
profile and on the hypothesis of hydrostatic equilibrium, we derived the total mass profile of A478 down to 0.01 and up to 0.5 
times the virial radius. We tested different dark matter models against the observed mass profile. The Navarro, Frenk & White 
<1997t model is significantly preferred to other models. It leads to a total mass of M200 = 1.1 x 10'^ Mq for a concentration 
, parameter of c = 4.2 ± 0.4. The gas mass fraction increases slightly with radius. The gas mass fraction at a density contrast of 

S = 2500 is /gas = 0.13 ± 0.02, consistent with previous results on similar hot and massive clusters. We confirm the excess of 
absorption in the direction of A478. The derived absorbing column density exceeds the 21 cm measurement by a factor of ~ 2, 
this excess extending well beyond the cool core region. Through the study of this absorbing component and a cross correlation 
^ ' with infrared data, we argue that the absorption excess is of Galactic origin, rather than intrinsic to the cluster. 

C/3 . Key words, galaxies: clusters: individual: A478 - Intergalacic medium - Cosmology: observations, dark matter. X-rays: galax- 

ies: clusters 
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1 . Introduction The observed clusters seem to have a cusped dark matter pro- 

^ file as predicted by numerical simulations (NavaiTO, Frenk & 

. y . As nodes of large scale structure and thus places of dark matter ^^^^ ^p^. ^^^^.^ 

concentration, galaxy clusters can be used as powerful tools mQGSL). However, the central slope of the dark matter pro- 

to test theories of structure formation. The basic hierarchical p^^^.^j^ dispersion of the concentration parameter 

scenarios based on gravitation make the population of galaxy ^.^^^.^ ^^^^ ^^^^^^ Larger samples of high quality mass pro- 

clusters a homologous population of sources. Their physical r-, a a ^ f »u 

° ^ ^ ^ ^ nles are needed to further assess these points. 

properties follow scaling laws depending only on their redshift 
and mass, and their internal structures are similar. 

The exceptional capabilities of XMM-Newton in terms of I" this paper we present the XMM-Newton spectro-imaging 

sensitivity and of Chandra in term of spatial resolution allow observation of A478, a massive, relaxed nearby cluster (z = 

us to characterize the gas density and temperature profiles with 088 1 - Struble & Rood 199 9 1. Detected in surveys (UHURU, 

unprecedented accuracy. For a relaxed cluster, the hydrostatic HEAO-1, Ariel- V), this cluster is well known in X-rays and 

equations can be used to derive the underlying dark matter dis- Physical properties have been carefully studied with pre- 

tribution, from the very central part of clusters up to nearly the ^lo^^s X-ray observatories: EXOS AT (Edge & Stewart mjj, 

virial radius (David et al. 2001 Allen et al. 2001b Arabadjis, Einstein and Ginga (Johnstone et al. [D ^, ROSAT (AUeii 

Bautz & Garmire 2002 Allen, Schmidt & Fabian 2002 Pratt & al. White et al. 'TO'l and ASCA (Markevitchl^^i 

Arnaud .2002.2003. Lewis et al. 2003, Buote & Lewis . 2003. . White et al. 2000). All those previous studies converge for what 

concerns the overall temperature of the cluster, kT ~ 6.8 keV. 

Send ojfinint requests to: E. Pointecouteau, e-mail: Recently, Sun et al. f2003't performed a high angular resolu- 

pointecoSdiscovery . saclay . cea . fr tion study of the central part of the cluster with Chandra. They 
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pointed out the presence of an X-ray cavity in the very central 
part of the cluster which is anti-correlated with the radio lobes. 

Here we focus on the characterization of the gas and dark 
matter distribution of A478. In a companion paper de Plaa et 
al. (in prep.) present a detailed spectroscopic study of the metal 
abundances and their distribution within A478's core based on 
EPIC and RGS data. We present the observation and the differ- 
ent data processing steps in Sect. |2 In Sect. |3we briefly dis- 
cuss the cluster morphology. In Sect. |3 we analyze the surface 
brightness profile and derive the gas density profile. Spatially 
resolved spectroscopic analysis is presented in Sect. |5] where 
we also discuss the temperature and absorption profiles. In 
Sect. |6l we present the resulting total mass and gas mass frac- 
tion profiles of A478 and we discuss the shape of the dark mat- 
ter profile according to our observational results. 

Throughout this paper, we use Hq - 70 km Mpc"' s"', 
Q,„ = 0.3 and Qa - 0.7. In such a cosmological framework, at 
the cluster redshift (z = 0.0881) 1' = 99 kpc. 

2. XMM-Newton observations and data processing 

2.1. Observations 

A478 was observed with XMM-Newton during revolution 401 
for a total exposure time of 126 ks with the EMOS camera 
and 122 ks with the EPN camera. For the first 70 ks two CCDs 
(CCD#4 and #7) of the EMOS 1 camera were not operating. An 
offset observation, centered at a =04''12'«35^ 6 =10°15'45", 
17' South West of the cluster center, was performed in revolu- 
tion 401 for 38 ks (EMOS camera) and 34 ks (EPN camera). All 
observations were performed in EXTENDED FULL FRAME 
mode and using the THINl filter We used the calibrated events 
list produced by the XMM-Newton SOC pipeline and processed 
them with the SAS (v5.4.1). We also used the blank-sky event 
files accumulated by Lumb et al. (ii2002i) . 

2.2. Event list screening 

In this work, we only kept events with pattern to 12 from 
EMOS data, pattern from EPN data and flag equal for both 
detectors. 

First the event list for each camera and each observation 
was filtered for periods of high background due to soft proton 
flares. Visual inspection of the light curve in a source-free en- 
ergy band ([10 - 12] keV for EMOS and [12 - 14] keV for 
EPN) revealed long periods of high background. These were 
excluded. We then fine-tuned the flare cleaning by using a "3cr 
clipping" selection of good time intervals as described in Pratt 
& Arnaud (I2003t . The Poisson fit of the Ught curve histogram 
provides a 3cr threshold above which corresponding frames are 
discarded. To excise all flaring periods, this clipping method 
was applied in diff'erent energy bands: [2 - 5] keV, [5 - 8] keV 
[8 - 10] keV and [10 - 12] keV for EMOS; [5 - 8] keV, 
[8 - 10] keV, [10 - 12] keV and [12 - 14] keV for EPN. This 
quite conservative choice was made to avoid any low energy 
flares that are present in some observations (Pratt & Arnaud 
[2002^ . 



The remaining exposure times after cleaning are 48.0 ks, 
40.9 ks and 37.3 ks for the central pointing and the EMOSl, 
EMOS2 and EPN camera respectively. The corresponding 
times for the off'set pointing are 13.1 ks for the EMOS cam- 
era and 1 1 .0 ks for the EPN camera. 

2.3. Extraction of imaging data and spectra 

For each camera, we merged the cleaned event lists of the two 
observations into a single event list. We only selected events 
falling within a 15' circular region centered on the detector op- 
tical axis. The sky pixel coordinates of each event of the offset 
pointing were re-projected onto the sky pixel grid of the central 
pointing. 

Scientific products (spectra, images, surface brightness pro- 
files) can thus be extracted in a single step from the merged 
events list, simplifying the analysis. However, the exposure 
time can vary strongly in a given extraction region and this 
has to be taken into account in estimating count rates. The ex- 
traction method we used is fully described in AppendixlAl All 
the products are corrected for vignetting in this extraction pro- 
cess. The vignetting correction is based on the photon weight- 
ing method described in Arnaud et al. (2001 ), the weight co- 
efficients being tabulated in the event list with the SAS task 

EVIG WEIGHT. 

2.4. Bacl<ground subtraction 

For each camera, a clean background event list was extracted 
from the corresponding blank-field data using the same filter- 
ing criteria (pattern and flag selection, 3cr flare clipping) as for 
the observations. Cast-background files were then generated 
for each pointing by applying the aspect solution of the ob- 
servation to the background dataset. The correct exposure time 
was computed for each blank field event list. 

To estimate the difference of particle background level be- 
tween each observation and the corresponding blank field data, 
we computed the ratio of the total count rates in the high en- 
ergy band ([10 - 12] keV and [12 - 14] keV respectively for 
EMOS and EPN data). As the expected average temperature of 
this bright cluster is quite high (~ 6.8 keV) we excluded a cir- 
cular central region (0 < 6') to avoid any contamination due to 
the cluster. 

Blank field products have to be normalized by this ratio, 
when used as a background for the corresponding observation 
products. The normalizations are slightly different from one 
pointing to the other To take that normalization into account, 
we multiplied the weight coefficients in each cast background 
file by the corresponding normalization factor Vignetting- 
corrected blank field counts are then automatically 'normal- 
ized' . These cast background files of the two pointings are then 
merged as for the observations. Product extraction is then the 
same (see AppendixlAl. 

For the EPN data, we generated a list of out-of-time events 
(OoT hereafter) to be treated as an additional background com- 
ponent. An OoT event occurs when a photon is detected during 
the read-out process. The current observing mode (Extended 
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Fig. 1. (left panel) Total (EMOS+EPN) XMM-Newton mosaic image of A478 in the [0.3 - 7] keV energy band (left panel). The 
image is coiTected for vignetting and exposure, and background subtracted using blank field data. Right panel: hardness ratio 
map (ratio of the count rates in the [1.7-7] keV and in the [0.3 - 1.7] keV energy bands). The smoothed contours of the mosaic 
image have been overlaid. 



Full Frame) minimizes the effect of OoT to 2.3%. The OoT 
event list was processed similarly to the observation EPN event 
files. 

The background subtraction (for spectra and surface bright- 
ness profiles) is performed as described in full detail in Arnaud 
et al. J2002t . It is a two-step procedure, which insures a correct 
cosmic X-ray background (CXRB) subtraction, even when the 
local CRXB is different from the average CRXB in blank field 
data. In a first step, for each product extracted from the merged 
observation event list, an equivalent product is extracted from 
the corresponding merged blank-field file and subtracted from 
it. For EPN the OoT data are also subtracted. This first step 
allows us to remove the particle background. However, it may 
over(under) subtract the CRXB if the CRXB in the observation 
region is smaller(larger) than the average value in the blank 
field observation. The residual CXRB (i.e. the difference be- 
tween the CXRB in the A478 field and in the blank field) is 
then estimated by using blank field subtracted data in the region 
free of cluster emission (9 > 16' from the cluster center - The 
cluster is significantly detected in the background subtracted 
surface brightness profile up to 12-13'). This residual CRXB is 
subtracted in a second step from each EMOS and EPN product. 
In our case the residual is negative in the [0.3 - 3] keV energy 
band. The residual count rate summed over the three detectors 
is -0.68 counts/s/arcmin^, which represents 23% of the total 
background count rate in this energy band. Beyond 3 keV the 
residual background (r > 16') spectrum is consistent with zero. 
Therefore the double subtraction beyond 3 keV will only con- 
tribute increasing the noise level in each channel. To minimize 



errors, the double subtraction is thus applied only to data in 
the energy band [0.3 - 3] keV. We nevertheless check (on the 
global spectrum) that the best fit values remain the same if a 
full double subtraction is applied. 

2.5. Point source exclusion 

Starting from the output of the SAS detection source task, we 
made a visual selection on a wide energy band EMOS & EPN 
image (extracted from the merged event lists) of point sources 
in the FOV. Events from these regions were excluded directly 
from each merged event list. We generated corresponding mask 
mosaic images, which were then used to compute the surface 
of each extraction region. 



3. Cluster morphology 

The mosaic countrate image (EMOS+EPN) in the [0.3-7] keV 
energy band is presented in Fig. ^ together with the hard- 
ness ratio map computed from the ratio of the images in the 
[1.7-7] keV and [0.3 - 1.7] keV energy bands. The images in 
various energy bands are vignetting coiTected and background 
subtracted using the corresponding blank field image and the 
OoT image in case of EPN. To generate the hardness ratio map, 
the hard band image was first adaptively smoothed (with the 
task asmooth). The soft band image was then smoothed using 
the same smoothing template as was created for the hard band 
image. The cluster morphology is regular and the hardness ra- 
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Fig. 2. A478 XMM-Newton (EMOS+EPN) surface brightness 
profile in the [0.3 - 2.0] keV band. The profile is background 
subtracted, corrected for vignetting and for the radial variations 
of the emissivity factor (see text). The best fit KBB gas density 
model (Eq.Q, fitted over the > 0.33' region is over-plotted as 
a solid line. The dotted line is the extrapolation of this model 
in the central region. Bottom panel: ratio between the data and 
the model. The dotted lines indicate the ±5% level. 

tio map does not exhibit any peculiar feature, reinforcing the 
assessment that A478 is a very relaxed cluster 

A478 has an elliptical shape. From optical and 
ROSAT/PSPC and HRI data White et al. (1994), derived 
an ellipticity e varying between 1.2 and 1.4 within the 
central (6 < 100") region. We fitted a 2D j6-model to the 
EMOS1+EMOS2 image in the [0.3 - 2] keV energy band, 
within the < 10' region. We derived a consistent value of the 
ellipticity, e - 1.22. The quality of the fit is poor, however: 
the residual image shows a strong excess at the cluster center 
position, as expected for a strong cooling flow cluster. 

Despite its slightly elliptical shape, in the remainder of this 
work we assume spherical symmetry and use circular annuli 
to extract the surface brightness profile and spectra. Pratt & 
Arnaud (2002i showed in the case of A1413, a cluster of a 
higher ellipticity of e = 1.4, that this has negligible impact 
on the derived temperature and mass profiles. 

4. Gas density radial profile 

4.1. Surface brightness profile 

We computed a background-subtracted vignetting-corrected ra- 
dial surface brightness profile for each detector in the [0.3 - 
2] keV energy band. The width of the radial bins was 3.3". The 
profiles for the three detectors were added into a single profile. 




Radius (arcmin) 

Fig. 3. Same as Fig.|2lwith the best fit BBB model (Eq.Q fitted 
over the full radial range. 



which was rebinned to reach a significance level of at least 3cr 
in each radial bin. The cluster emission is detected up to 13'. 

The emissivity in the considered energy band varies slightly 
with radius, due to the radial gradients in the hydrogen col- 
umn density along the line of sight (A^h), temperature (kr) and 
metallicity (Z) (See Sect. 15. 2> . The A^h, kr and Z values were 
extrapolated at each radius of the surface brightness profile by 
fitting the observed A^h, kr and Z profiles (See Sect. 15. 2> with 
a 3 degree polynomial, an empirical temperature profile as de- 
scribed in Allen et al. ( 2001b i and a lognorm law respectively. 
The corresponding emissivity profile (with errors) was esti- 
mated using an absorbed redshifted mekal model, convolved 
with the instrument response. Its radial variation is mainly 
dominated by the variations of A^h- The surface brightness pro- 
file was then divided by the emissivity profile normalized to its 
value at large radii. The errors on the emissivity were propa- 
gated to the corrected surface brightness profile. 

4.2. Gas density profile modeling 

The corrected surface brightness profile (presented in Fig. |2j 
is proportional to the emission measure profile, EM{r), and 
can be fitted directly using various parametric models of the 
gas density profile, ne('")- The corresponding emission mea- 
sure models were convoluted with the XMM-Newton PSF 
(Ghizzardi et al. 120011 12002> and binned in the same way as 
the observed profile. 

As expected, a standard y6-model provides an unacceptable 
fit, the data showing a strong excess in the center compared to 
the model. Progressively cutting the central region decreases 
the reduced^^. The fit becomes acceptable (x^^^ ~ 1) for a cut- 
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out radius of ^cut ~ 2.4' with a best fit /? value of 0.79 + 0.01 
(Icr errors). The reduced Jt'^ keeps decreasing with increasing 
Ocut until it stabilizes to a value of x^^^ ~ 0.7 for ^cut > 3'. 
There is an indication that /3 increases with increasing ^cut. btit 
the effect is marginally significant: for instance we obtained 
/3 = 0.85 + 0.05 for 6lcut = 3.9'. 

We then considered the alternative parametrisations of the 
gas density profile proposed by Pratt & Arnaud ( 2002| for cases 
where a central excess is seen. We fitted the entire profile with 
a cusped model, similar to the NFW profile (their AB model) 
and a double isothermal yS-model (their BB model). Both mod- 
els fail to account for the data, although formally the latter pro- 
vides a better fit than the former: the reduced are respec- 
tively 5.5 and 4.2. We then tried a generalized double yS-model 
(their KBB model). This model allows a more centrally peaked 
gas density profile in the core than the BB model and is defined 
by: 



r < Rent n^ir) = ne(0) 
r > Rent n^ir) = 



2f 



(1) 



where ^, r^, Rcut, and p are free parameters, the parameters 
and Pi being related to them so that both the density distribution 
and its gradient are continuous across Rent- 

This model provides an excellent fit to the data, but only 
if the very central part is excluded from the fit (see Fig. |2j- 
Fitting the {6 > 0.33' ~ 32 kpc) region gives a reduced of 
Xl^ ~ 0.92 for^ = 0.42, r„ = 0.99', R,,t = 4.5', = 2.5' and 
y6 = 0.8 1 . A clear excess is observed when extrapolating this 
model in the central part (see Fig.|5J- 

We then tried a 3 component model. We went back to direct 
parametrisation of the emission measure profile (rather than the 
density profile) for simplicity. We considered the sum of three 
y6-models (hereafter BBB model). 



EM{r) 
EMi(r) 



EMi(r) + EM2(r) + EM3(r) 



EMi(O) 



-3P+1/2 



(2) 



A common /? value is assumed to insure a smooth (single power 
law) behavior at high radii. The model has thus 7 free parame- 
ters. The corresponding density profile is: 



(3) 



where each ne/('") profile is the density jS-model profile corre- 
sponding to the emission measure profile £'M,(r). 

This BBB model provides an excellent fit to the data over 
the whole radial range: the reduced x^ is X^^a ~ ^-^^ ^'^^ -'^ 
d.o.f. This best fit model is plotted in Fig. |3l together with 
the ratio between data and model. The best fit parameters are 
P = 0.84+°;04, = 0.25' + 0.02', = 1.12' + 0.07', 

= 3.3' + 0.3' (90% errors). The best fit central density is 
«e(0) - 0.109cm"'', with a relative normalization of the sec- 
ond and third components of (ne2(0)/ne(0))^ = 5.40 10"^ and 
(«e2(0)/ne(0))^ = 1.32 10"^ respectively. 

We further compared this best fit density profile with the 
profile obtained by Sun et al. ( >2003j from a deprojection 
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Fig. 4. Comparison between the Chandra gas density profile de- 
rived by Sun et al. C200T) and the XMM-Newton best fit BBB 
gas density model (as plotted in Fig. |3}- Bottom panel: ratio 
between the Chandra data and this model. 



of the Chandra ACIS-S3 surface brightness (see Fig. 1^. The 
Chandra profile is determined out to ~ 5'. There is a good 
agreement between the two profiles, both in shape and in 
normalization. Simply adjusting the overall normalization of 
the XMM-Newton best fit model to the Chandra data gives al- 
ready a reduced of 1.3 with residuals between model and 
data less than 5%. The normahzation is 2% lower than the 
XMM-Newton value, corresponding to ~ 4% discrepancy in 
X-ray flux, well within the discrepancy of ~ +5% between 
the two instruments found by Snowden J2002> . Although the 
best fit XMM model does not fit perfectly the Chandra profile 
shape, slightly adjusting the parameters provides an acceptable 
fit. We kept the p and outer core radius values to their best fit 
XMM values, as the external shape of the density profile is not 
well constrained by Chandra data. We obtained a reduced of 
1.0 for - 0.26' and - 1.18', consistent with the XMM 
90% confidence range. The relative normalizations of the three 
components are marginally inconsistent. Nevertheless, taking 
into account that the effect of the Chandra PSF is negligible, 
the good agreement between XMM and Chandra central core 
radius indicates that the PSF modeling we have used to fit the 
XMM-Newton data is basically correct. 

The XMM best fit BBB model is thus used in the following 
to estimate the cluster gas and total mass profiles and to cor- 
rect the temperature profile for PSF and projection effects. The 
total mass profile depends on the logarithmic slope of the den- 
sity profile. To estimate the systematic uncertainties in the mass 
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Table 1. A478 global properties from previous studies 





Exosat " 


Ginga/Einstein ^ 


ROSAT/Ginga " 


ASCA ■' 


Chandra 


XMM f 


kT (keV) 

Nil (lO^'cm-^) 

Z(Zo) 


0.27^1^ 


3.6!gi 
0.37-- 


6 56+" ''* 

"•-'"-0.14 

7 49+0.12 

"■■^^-0.04 


6.58;j5;g 

3.0 8 

3J+0.03 

"■-'^-0.03 


1 1 s+o.ii 

9 j:q+0.03 
^•-^'-0.03 

37+°-"^ 

"■-"-0.02 


6 17+"'- 

"•^ '-0.06 
^•""-0.03 

32+°-'" 

"■-'^-0.01 



^Edge & Stewart fT99n ''Johnstone et al. fT992l Allen et al. fT993l and White et al. [19941 ''White l2000l "^Sun et al. l2003l This work. 
8 fixed parameter. 

Note: all studies include the cooling core regions. 



estimates we will also consider the density logarithmic slopes 
derived from the BBB model best fitting the Chandra profile. 
The differences are small however, in the range 0.2 - 4.5%. 
Finally we would like to emphasize that the BBB model func- 
tional form must only be viewed as a convenient parametric 
representation of the gas density profile. It has no particular 
physical ground and must not be over-interpreted (e.g in terms 
of three distinct gravitational systems in the cluster). 

5. Temperature and absorption profiles 

Throughout the analysis, the spectra are binned to reach 
a significance level of at least 3cr in each bin. We used 
xsPEC to fit the data with an absorbed redshifted thermal 
model (wabs(mekal)). Due to larger calibration uncertain- 
ties in the instrument response below the O edge we only 
fitted the spectra above E - 0.6 keV. We used the fol- 
lowing response matrices: ml_thinlv9q20t5r6 jlLlS.rsp 
(EMOSl), m2.thinlv9q20t5r6jlL15.rsp (EMOS2) and 
epn_ef20_sY9_thin.rsp (EPN, created in December 2002). 
Unless otherwise stated, the relative normalizations of the EPN 
and EMOS spectra were left free when fitted simultaneously. 

5.1. Global spectrum analysis 

We first extracted the cluster EMOSl, EM0S2 and EPN spec- 
tra within a circular region of 10'. Fitting simultaneously the 
EMOS and EPN spectra, we obtained a redshift of z = 0.0868 + 
0.0004. This value is significantly smaller than the optical value 
(z - 0.0881 + 0.0009). An investigation of the variation of 

with z revealed two local minima, one at the optical red- 
shift location and one at z = 0.079. Independent spectral fits 
of EMOS and EPN spectra clarify the issue. The EMOS best 
fit value is z = 0.0889 + 0.0010, perfectly consistent with the 
optical value. On the other hand, the EPN best fit redshift is 
z = 0.0793 + 0.0007 and corresponds to the second minimum. 
The redshift difference from the optical value corresponds to an 
energy shift of AE - +50 ± 4 eV with respect to the expected 
iron line position. 

We thus checked for a possible gain problem in the EPN 
data. The non-background subtracted spectrum shows strong 
Al-K, Ni-K and Cu-K fluorescence lines, which can be used 
for this purpose. We fitted the spectrum extracted in the 6' < 
6 < 12' region in a restricted energy range around these lines 
with a power law plus Gaussian hne(s). The fitted Ni-K and 
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Fig. 5. Result of the radial spectral fitting. From top to bottom 
the temperature (kT), Galactic absorption (A^h), metallicity (Z) 
and reduced for the best fit model. Errors are Icr. For the 
temperature profile the best fit model using Eq. @ is over- 
plotted as a dashed line. 

Cu-K line energies are significantly higher than expected: E - 
7517+6 eV and £■ - 808 1 ± 1 .5 eV respectively, to be compared 
to the expected values of 7477 eV and 8047 eV. However, the 
discrepancies, AE - 40+6 eV and AE = 34±2 eV respectively, 
are smaller than observed for the cluster iron line. Furthermore 
we cannot simply add a constant offset to the energy scale: the 
centroid energy of Al-K, E = 1489^3 eV, is consistent with the 
expected value of 1487 eV. Any hnear gain correction based 
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on the position of these fluorescence lines would thus be insuf- 
ficient to bring the EPN redshift determination into agreement 
with the optical value. 

We thus did not try to define and apply a gain correction 
In principle, the EPN gain uncertainty could affect our results. 
To assess this point, we fitted the EPN spectrum both fixing the 
redshift at the optical value and letting this parameter free. The 
derived A^h values are the same, the best fit temperatures differ 
by 0.10 keV, similar to the statistical uncertainty. As expected 
the main impact is on the derived abundance: for a free redshift 
Z = 0.34 + 0.02 Zq, only marginally consistent with the value, 
Z = 0.30 ± 0.02 Zq, obtained fixing the redshift at the optical 
value. When the EPN spectrum is fitted simultaneously with 
the EMOS spectrum, these discrepancies are even smaller The 
abundance difference is two times less and the temperature is 
the same. 

As we are mostly interested in the temperature informa- 
tion, we can neglect the gain uncertainty and choose to fix the 
redshift to the optical value, in all the following analysis. The 
overall cluster parameters are: A^h = 2.66^nn? x 10^' cm"^ 



6.17+° i2keV andZ 



90% confidence level). 
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5.2. Annular spectra analysis 

We extracted background-subtracted, vignetting-corrected 
spectra in 13 concentric annuli centered at the peak of the X- 
ray emission. The annuli were defined to have about the same 
number of counts per bins (except the outermost annulus). 

The EMOSl, EMOS2 and EPN spectra of each annulus 
were simultaneously fitted with a wabs(mekal) model. The re- 
sulting A^H, kr, and abundance profiles are plotted in Fig. |5] 
The definition of the annuli and the best fit parameters are gath- 
ered in Table 12 We also checked that the temperature profiles 
obtained by fitting the EMOS and EPN spectra independently 
are consistent within the error bars. 

The temperature profile shows a clear drop towards the cen- 
ter. It is well fitted by the analytical formula proposed by Allen 
et al. J2()()lb> : 

(r/r,)" 
1 +(r/rc)" 

with To = 3.26 keV and Ti = 3.52 keV, = 0.396' and 
J] - 1.52 (x^ - 12.7 for 13 degrees of freedom). The best fit 
model is plotted in Fig.|5] To fit the observed profile with such a 
formula, we had to assign a radius to each annulus temperature. 
Following the prescription of Lewis et al. ( 2003 1, we used the 
weighted effective radius of each annulus, defined as: 



T(r) = To + Ti 



(4) 



n = [(> 



3/2 



.3/2 



)/2] 



2/3 



(5) 



rather than the mean radius. We checked that the best fit tem- 
perature profile then becomes insensitive to the binning choice. 
We regrouped the first two annuli and then the next two annuli 
and re-ran an isothermal fit to the spectra of those new larger 
annuli. The resulting profiles were fitted again with Eq. (0}, us- 
ing the weighted effective radii. We obtained the same best fit 
profile as with the original binning. This is not the case if we 
use instead the mean radius of each annulus. 



The temperature profile shows a strong gradient towards 
the center, whereas we recall that the surface brightness profile 
is very peaked. This temperature profile is thus likely to be af- 
fected by both PSF and projection effects. These effects will be 
analyzed in Sect. 15.51 However, the overall A^h value is signif- 
icantly higher than the 21 cm value and its radial profile is not 
flat. As the A^h and kr determination are not independent, we 
will first discuss our absorption results. 



Table 2. Radial spectral fitting - best fit values and associated 
error bars (90% confidence level) 





Ah 


kT 




z 


xL (d-o.f) 


(arcmin) 


(lO^'cm--) 


(keV) 




(Zo) 




0.14 


3.00±0.10 


3.60+0.10 





43+0.05 


1.04(1140) 


0.41 


2.97+0.05 


4.54+0.07 





38+0.02 


1.08 (1979) 


0.69 


2.85±0.05 


5.60+0.10 





32+0.02 


1.03 (2111) 


1.02 


2.81+0.05 


5.88+0.11 





31+0.02 


1.09 (2135) 


1.40 


2.82±0.05 


6.21+0.14 





32+0.02 


1.03 (2052) 


1.84 


2.81+0.05 


6.32+0.16 





28+0.02 


1.02 (1937) 


2.34 


2.77+0.06 


6.50+0.17 





28+0.03 


1.01 (1926) 


2.94 


2.68+0.06 


6.67+0.20 





28+0.03 


1.04(1736) 


3.66 


2.54+0.07 


6.80+0.25 





24+0.04 


0.97 (1559) 


4.54 


2.41+0.08 


6.91+0.30 




0.26'' 


1.00(1405) 


5.64 


2.39+0.10 


6.53+0.39 




0.26'' 


0.98 (1153) 


7.01 


2.01+0.14 


6.46+0.58 




0.26'' 


0.88 (843) 


12.0 


1.68+0.22 


7.04+1.29 




0.26'' 


0.90 (443) 



' External radius of the annuli. '' fixed parameter. 



5.3. The absorption profile 



Our best fit overall value for the Galactic absorption, A^h = 
2.66;';qq3 X 10^' cm"^, is nearly two times the 21 cm value 
of A^H = 1.53 X 10^1 cm-2 (Dickey & Lockman [T990l . Such 
an excess absorption was found in all previous X-rays studies 
of A478 and our derived value is only marginally higher than 
the value derived from ROSAT/Ginga and Chandra data (see 
TableQ. 

The radial A^h profile that we obtained exhibits a clear gra- 
dient ranging from 3x10^' cm"^ in the central regions to 1 .7 x 



10 cm at 12'. This gradient is consistent with the Chandra 
gradient measured by Sun et al. (.2003) : from a central value of 
2.9 ± 0.1 X 10^' cm-2 down to 2.4 + 0.1 X 10^^ cm^^ for the last 
bin at ~ 5' (perfectly consistent with the XMM-Newton value, 
see Table IS. 

This excess of absorption seen in A478 and other cooling 
flow clusters was interpreted in previous studies (e.g. Allen 
et al. 1993 2001 a l in terms of intrinsic absorption by very 
cold gas related to the strong cooling flow. However our vi- 
sion of cool cluster cores has dramatically changed due to 
XMM-Newton observations. The standard cooling flow model 
predicts low energy emission lines which are simply not seen 
in the RGS spectra (Peterson et al. 120011 12003). This stan- 
dard model is also inconsistent with EPIC data (e.g Molendi 
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Fig. 6. IRAS 100 yum (Schlegel et al. [D^ in A478 direction. 
The overplotted contours show the X-ray emission as well as 
the area covered by our XMM-Newton observation. The vertical 
and horizontal lines defined the sectors used to cross-check the 
A^H profiles (see text). 
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Fig. 7. A^H profiles derived from XMM-Newton data in the 
North-East (diamonds), North-West (triangles), South-West 
(squares) and South-East (crosses) quadrant compared to the 
azimuthal profile (smoothed - solid line). The two dashed lines 
are the 3cr error bars (smoothed) associated to the azimuthal 
profile. The two dotted-dashed lines correspond to the A^h value 
estimated from the 100 lum IRAS map according to Boulanger 
et al. (upper line - I1996> and Schlegel et al. (lower line - I1998> . 



& Pizzolato IMJTl Matsushita et al. IMHI Bohringer et al. 
120021 Kaastra et al. l2003> . No evidence of intrinsic absorption 
was found with XMM-Newton in the center of the cooling flow 
regions in M87 and the Perseus cluster, and Bohringer et al. 
J2002> argued that the excess absorption measured by previous 
missions is an artifact of fitting standard cooling flow models. 
The low energy emission over-predicted by this model can be 
artificially suppressed by adding an extra absorption compo- 
nent when fitting spectra obtained with instruments like ASCA, 
which have relatively low sensitivity at low energies. 

Although the absorption excess in A478 is confirmed by 
XMM-Newton data, it is more likely, in view of our current 
knowledge of cooling cores in clusters, that all the absorption is 
of Galactic origin. This hypothesis is reinforced by the spatial 
distribution of the excess absorption: the excess extends well 
beyond the cool core region. We also note that if cold gas has 
indeed now been detected in the core of clusters, like A478, 
through CO measurements (Edge, 2001), there is still a large 
mismatch, by an order of magnitude, between the inferred col- 
umn densities and the absorption excess (see Edge 2001, for 
full discussion). Finally the local CXRB that we measure with 
XMM-Newton is lower than the average blank field value (see 
Sect. El). T he Ros at All Sky Survey (RASS) 3/4 keV maps 
(see Snowden'1997'l also clearly show a deficit of CXRB in that 
region. This again points towards a high Galactic absorption. 

To further test the origin of the absorption and the robust- 
ness of our A^H estimates, we considered other indicators of the 
Galactic gas column density. We recall that the column den- 



sity derived from X-ray data is actually the total hydrogen col- 
umn density (assuming standard abundances). We considered 
the IRAS 100 yum cleaned map of Schlegel et al. (IWl. The 
Galactic hydrogen is correlated with the Galactic dust respon- 
sible for the IR emission, as shown by Boulanger et al. J1996t 
and Schlegel et al. (TWi, who correlated COBE/DIRBE and 
IRAS data with the Leiden/Dwingeloo survey (Hartmann & 
Burton [T997t . The correlation between the IR emission and 
the atomic hydrogen column density is determined from low 
A^H data (A^h < 4.6 x 10^" cm"^). Above this threshold an in- 
creasing dispersion is observed with higher IR/HI ratio on av- 
erage. Boulanger et al. (1996) argued that this excess IR emis- 
sion is due to dust associated with molecular Hydrogen. The 
IR emission could thus actually be a tracer of the total hydro- 
gen content. Assuming that the correlation determined at low 
A^H values (where the H2 fraction is expected to be small) is 
representative of this IR - total A^h correlation, we converted 
the IR brightness map into a total A^h map. We used both the 
Boulanger et al. ( .1996) and Schlegel et al. ( 1998) results, these 
two groups having derived slightly different correlation coeffi- 
cients. We then derived radial profiles which are compared with 
the XMM-Newton derived A^h profile in Fig.0 Interestingly, the 
IR and X-ray derived A^h profiles are indeed found to be con- 
sistent up to about 5'. 

Beyond that radius the XMM-Newton Nh profile starts to 
become significantly lower than the expected values from the 
IR emission. However, the IR emission shows a strong gra- 
dient over the cluster area in the north-east/south-west direc- 
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tion (see Fig. |6} . There is a drop by a factor of two between 
the north-east sector of our mosaic and the south-east sector. 
Obviously the azimuthal average tends to smooth the gradient 
effects. Keeping the previous definition of annuli, we then di- 
vided each annulus in four sectors separated by a North-South 
and an East- West axis (see Fig. |6j. In each sector, we ran a 
spectral fit for each annulus. The resulting A^h profiles are pre- 
sented in Fig. All four profiles are compatible with the az- 
imuthally averaged profile within a 3cr limit. One can notice, 
however, that the South-East measurements are systematically 
lower than the azimuthal values whereas the North-West values 
are systematically larger 

In summary, both X-ray and IR data indicate a higher 
Galactic hydrogen column density than the 21 cm value. Both 
X-ray and IR A^h estimates agree remarkably well in the clus- 
ter center, suggesting that the excess absorption is indeed of 
Galactic origin. Moreover, FIR observations from the ISO 
satellite at 90 and 1 80 fim show a color ratio favoring a cold 
temperature structure (Pointecouteau & Giard, in preparation), 
which is more likely to be due to a Galactic structure than to 
an intracluster dust component. Indeed, the expected temper- 
atures for the intracluster dust according to the current mod- 
els are > 20 K (Montier & Giard 2003 1. However, the radial 
variations of the X-ray and IR derived hydrogen column den- 
sity differ This could be due to variations in gas to dust ratio 
and/or metallicity for instance. To probe the foreground struc- 
ture on the cluster scale toward A478, FIR observations with 
a higher spatial resolution would be extremely useful. For in- 
stance, soon the ASTRO-F mission ( Shibai 2002 1 will survey 
the whole sky at FIR wavelengths and will provide observa- 
tions up to 200 i-im that will reveal the galactic cold component 
structure. These upcoming observations on the whole cluster 
scale will certainly help to clarify this issue. 

In view of the discrepancy between the IR and X-ray de- 
rived A^H values beyond 6 ~ 5', we further checked the ro- 
bustness of our A^H and thus kr measurements in that re- 
gion (two outer annuli). We ran again the individual spec- 
tral fit on the annuli, fixing the value of the Galactic absorp- 
tion to 2.5 X 10^' cm"^. The derived temperature drops from 
~ 7 keV down to ~ 5 keV. However, the fit is significantly 
worse. Indeed, the F-test probability, given the value and 
the one obtained previously by letting free the A^h, is 10"**. 
Furthermore, if we still fix the A^h value to 2.5 x 10^' cm"^, 
but fit the spectra for £ > 1 .5 ke V, avoiding the low energy 
band that is sensitive to the Galactic absorption, the derived 
temperature profile is fully compatible with the nominal one. 
Therefore, we can be reasonably confident that the derived ra- 
dial column density and temperature profiles are real. 

5.4. Comparison with previous results 

The best fit values we derived for the overall physical proper- 
ties of A478 are in agreement with the previous results for the 
Galactic absorption and the metallicity (see Table[0. Moreover, 
their radial profiles match closely those derived by Sun et al. 
J200 3') using the Chandra data. However, our overall tempera- 
ture value is marginally compatible with the temperature val- 



ues from ROSAT/Ginga and ASCA. A478 does not exhibit an 
overall isothermal plasma (presence of a cool core). The tem- 
perature derived from an isothermal fit is actually an emis- 
sion weighted temperature, which depends on the instrument 
response. Due to its higher sensitivity at low energies, XMM 
is more sensitive to the presence of a cool component. This 
would explain the slightly lower temperature derived from our 
data with respect to Ginga and ASCA results. To further check 
this point, we fixed the absorption value to 2.5 x 10^' cm"^ (the 
value derived from the ROSAT/Ginga analysis) and we fitted 
the overall spectrum over the [1-10] keV energy band. The best 
fit temperature value is then kT = 6.42 + 0.06 keV, a value 
compatible with ROSAT/Ginga value, as well as ASCA value. 

Despite this agreement, some important discrepancies 
appear between the temperature profiles obtained from 
XMM-Newton and Chandra. If we focus on the average 
value of the temperature excluding the cool core region, 
the Chandra value (~ 8.5 keV) is significantly higher than 
the XMM-Newton value (~ 6.5 keV). Similar discrepan- 
cies appear for the luminous X-ray cluster PKS 0745-191. 
Indeed, the temperature derived from the Chandra analysis 
JHicks et al. 200 2>) outside the cool core (1.5' < r < 2.3'), 
~ 10.5 keV, is significantly higher than the value derived from 
XMM-Newton data ( Chen, Ikebe, & Bohringer 2003 ) in the 
same region, ~ 7.5 keV. In this case, the XMM-Newton result 
compares better with the value of ~ 8 keV by BeppoSAX 
(De Grandi & Molendi 1999). However, it must be noted that 
the BeppoSAX temperature measurement includes the cold 
core region which is likely to induce a bias toward lower tem- 
peratures. 

We have failed to explain the discrepancy between 
XMM-Newton and Chandra. Apart from calibration related 
problems, we thought that it could be due to background sub- 
traction problems. For both PKS 0745-191 and A478, the 
CXRB was found with XMM-Newton to be different from 
the CXRB of a typical blank field. The higher A^h observed 
in the direction of the A478 cluster certainly contributes to 
this difference for this cluster, but we cannot exclude a con- 
tribution from some intrinsic spatial variation in the soft X- 
ray Galactic emission. This difference was taken into account 
in the XMM-Newton background subtraction procedure (in the 
second subtraction step - see Sect, \2.4i . On the other hand, 
Chandra analysis had to rely on a simple blank field back- 
ground subtraction, by lack of data at large radii. In the Chandra 
analysis of A478, the CRXB is a priori oversubtracted and 
this could bias the temperature determination, especially in the 
outer cluster region where the CXRB count rate is no longer 
negligible with respect to the cluster count rate. To test this 
hypothesis for A478, we perform a single blank field subtrac- 
tion for each XMM annular spectrum and re-ran the spec- 
tral isothermal fit. The resulting absorption and temperature 
profiles become significantly different beyond 5' (the upper 
limit of the Chandra profile). However, below ~ 5', the pro- 
files are not significantly affected and remain inconsistent with 
Chandra values'. Although the Chandra blank field observa- 

' Our background subtraction procedure assumes that the CRXB 
does not vary within the field of view. The low value of the local 
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tions are not the same as those of XMM-Newton, it is thus un- 
likely that the background subtraction issue is an explanation 
of the discrepancies. Furthermore the agreement between the 
XMM-Newton and Chandra profiles for the Galactic absorption 
would be puzzling if that was the explanation, since the A^h 
determination is particularly sensitive to the subtraction of the 
residual CXRB, which affects the low energy part of the spec- 
trum most. 

5.5. The cluster temperature profile: correction of PSF 
and projection effects. 

The central drop in the temperature profile emphasizes the need 
to perform a deprojection analysis and to take into account the 
PSF effect. For such a highly peaked cooling flow cluster, PSF 
effects are important in the center where we have chosen nar- 
row bins for the temperature computation to recover the best 
temperature profile. 

5.5.1 . PSF and projection effect modeling 

The incident emission of each annulus, /, is the projected sum 
of the emission from various shells, j > i. This emission is 
then redistributed among various annuli due to the finite PSF. 
In principle, we should apply an absorption model to each in- 
cident annular spectrum (i.e. after the projection and before the 
PSF convolution). However the A^h profile derived from the an- 
nular spectra fit does not show strong gradients in the central 
region (where the PSF effect is significant). We thus used a sin- 
gle absorption model for each observed annular spectrum, fix- 
ing the A^H value to the best fit value obtained from individual 
annular fits (see Table|5J. 

We thus model the observed annular spectra, S'^iE), with a 
linear combination of isothermal mekal models (normalized to 
the unit emission measure), multiplied by a wabs model: 

n 

S'^(E) - WABS(A^H,) ^ a,jMEKAL(rj, Zj) (6) 
J=l 

We first considered pure PSF effects. In that case, each a,j 
coefficient is the emission measure contribution of the ring j 
to the ring /. The fitted temperatures Tj can be considered as 
'PSF corrected' projected temperatures. The a, ^ factors were 
derived from our best fit gas density profile (BBB model), con- 
verted to an emission measure profile and convolved with the 
XMM-Newton PSF estimated at 1 keV. To crudely validate our 
absorption modeling, we ran this PSF correction fit leaving the 
A^H as a free parameter. We found completely consistent values 
with the best fit value of the annular spectral fit. 

CRXB (as compared to blank field value) may partly be due to a higher 
galactic absorption (Sect. l5.3V We thus cannot exclude that the CRXB 
actually varies within the FOV, in view of the observed A'h variations 
(see Fig.0. However, this further test shows that the double subtrac- 
tion is really needed only for the cluster outskirts (5' - 12'), closest 
to the region chosen to compute the local background (r > 16'). This 
should minimise any artifact due to our neglect of possible background 
radial variation. 



Similarly we assess the projection effects, neglecting the 
PSF blurring. We used the same formula with the ciij redistri- 
bution factors being now the emission measure contribution of 
the shell j to the ring / and T j the temperature of the shell j 
(assumed to be isothermal). 

Finally we took into account both effects, using as Uij the 
emission measure contribution of the shell j to the ring / after 
convolution with the PSF. 



5.5.2. Simultaneous fits of annular spectra 

The fitting was done with xspec We have to take into ac- 
count that xspec can only handle 1000 parameters (even if 
most of them are frozen). The EMOSl, EMOS2 and EPN 
spectrum of each annulus was loaded into xspec as a data 
group. The same model parameters are applied to each spec- 
trum of a given group. Therefore, EMOS and EPN spectra have 
to be normalized in order to be fitted with the same normal- 
ization. Furthermore the a,- y coefficients are computed with- 
out taking into account flux loss due CCD gaps, bright pixels, 

etc We therefore first renormalized each spectrum by the 

ratio of the annular geometrical area to the actual extraction 
region surface (backscale value). After this coiTection, the ra- 
tio of the EMOS and EPN normalizations obtained from the 
annular fits (Sect. 15. 2> were found to be consistent with the 
ratio obtained by fitting the overall EMOS and EPN spectra: 
Nemos I EPN - 1.11. We thus applied this factor to all EPN 
spectra. Finally, we checked that the annular fit results indeed 
remain the same: the differences in derived kT are negligible 
compared to the statistical errors. 

Fitting simultaneously n annular spectra with a sum of n 
mekal models, multiplied by a wabs model, gives a total num- 
ber of parameters of (6n + \)n. We thus have to limit n to 12. 
To overcome this problem, we have used two different sets of 
12 annular spectra. The first set is obtained by grouping the last 
two annuli into a single annulus and the second set by group- 
ing the first two annuli. We then combined the first set results 
for annuli #1 to #9 with the second set results for annuli #10 to 
#13. 

For each set, the free parameters are the 12 temperatures, 
12 normalizations, one per data group (annulus), the other nor- 
malizations being linked according to Eq. (|6j. In practice we 
ignore all contributions less than 1%. We have frozen the abun- 
dance of each mekal model to the annular best fit value, except 
for the 4 innermost components corresponding to the r < 1' re- 
gion. Beyond that region the annular abundance profile is flat. 
PSF and projections effects are unlikely to affect the abundance 
determination in such a way that it has a significant impact on 
the temperature determination. To further check this point, we 
also fixed all the abundances to the best fit annular value. Only 
the central temperature is changed significantly. 

The resulting corrected temperatures are plotted versus ef- 
fective radius (defined in Eq.|5} in Fig.|8l These data are com- 
pared to the temperature profile derived in Sect. 15. 21 (hereafter 
the raw temperature profile). 
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Fig. 8. Temperature profiles corrected for the PSF effect (left panel), deprojected (middle panel), PSF corrected and deprojected 
(right panel). The data points are the temperatures derived from simultaneous fitting of the annular spectra. The open points are 
the temperatures obtained by fitting each annular spectrum individually with an isothermal model (Same as in Fig.|5jl. The dotted 
line is the analytical model fitted to these data (Eql^. The full lines are the PSF corrected model (left panel), the deprojected 
model (middle panel) and the PSF corrected deprojected model (see Sect. l5.5.3l for details). 



5.5.3. Correction of the temperature profile model 

We also considered the best fit model (Eq.|^ of the raw temper- 
ature profile (dotted line in Fig.|8|i. It can also be corrected for 
PSF and/or projection effects, assuming that the annular tem- 
peratures are emission-weighted temperatures: 



7=1 



(7) 



The factors Oij are the same as computed above. The PSF- 
corrected model, the deprojected model and the PSF-corrected 
deprojected model are plotted in Fig.|S](full lines). 



5.5.4. Results 

First, it is instructive to consider the PSF correction (Fig.|8j left 
panel) and the deprojection (Fig.|S] middle panel) separately. In 
both cases, the corrected model fits reasonably well the corre- 
sponding corrected temperature profile derived from the spec- 
tral fit. However, while the former remains a smooth function of 
radius, the later is more noisy. A comparison of the observed 
and model profiles before and after correction shows that the 
correction process amplifies any variation of the raw tempera- 
ture profile compared to the smooth model (see in particular the 
temperatures of annuli # 3, #10 #1 1 and #12). This is probably 
linked to the well known problem of noise ampUfication when 
deconvolving or deprojecting noisy data (see also Kaastra et al. 
12003 1 for a discussion on PSF/projection effects). 

The PSF-corrected projected temperatures and the depro- 
jected temperatures are consistent, within the error bars, with 
the raw temperatures in the external part of the cluster (nearly 
isothermal region, R > 100 kpc). Significant deviations are 
seen for the first four annuli. The general effect of the PSF 
blurring and of the projection are best seen by comparing the 



corrected and uncorrected models. The PSF affects mostly the 
central bin (the temperature of which is increased due to con- 
tamination by the higher temperatures of the external bins). In 
turn the other bins are contaminated by the low temperature 
central bin, and their observed raw temperatures are slightly 
lower than the incident ones. The effect is small however and 
consistently tends to zero with increasing radius. The main ef- 
fect is the projection effect. As expected it damps down the gra- 
dient in the central region. The deprojected temperatures are al- 
ways smaller than the projected ones, the effect increasing with 
decreasing radius. 

The PSF-corrected deprojected temperature profile is 
shown in Fig. |S] (right panel). When both PSF and projection 
effects are taken into account the noise amplification is dra- 
matic. The temperature profile derived from the spectral fit 
shows strong discontinuities (e.g. bins #3,4,5, bin #10). On 
the other hand, the corrected model remains smooth. Although 
the corrected temperatures derived from the spectral fit are not 
consistent with the corrected model within the statistical er- 
rors, they are distributed around it. The largest deviations cor- 
respond to the discontinuities mentioned above and are again 
clearly located around the bins which originally deviate most 
from the smooth model profile. These discontinuities are thus 
very likely non-physical and the corrected model is probably a 
better representation of the true cluster temperature profile than 
the profile derived from the spectral fit. As discussed in Sect.|6| 
we will use this corrected model as a reference in our compu- 
tation of the mass profile. 

The PSF-corrected deprojected model profile is consis- 
tently intermediate between the PSF-corrected model profile 
and the deprojected model profile for bins #2 to #13. As shown 
above, the PSF and projection have opposite effects in that re- 
gion. The model temperature of the central bin is however ex- 
tremely low. The PSF correction and deprojection was done 
assuming that the annular temperatures are emission-weighted 
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temperatures. This assumption is probably less and less valid 
with decreasing temperature and the low value we derive might 
be an artifact of our assumption. Furthermore the gas may be 
multiphase in that region (r < 20 kpc) due to the interaction 
of the intracluster medium with the central cD galaxy (see also 
below). 

6. Mass profile 

6.1. Calculation of the total mass profile 

The total gravitational mass distribution shown in Fig. |5] was 
calculated under the usual assumptions of hydrostatic equilib- 
rium and spherical symmetry using 



M(r) 



kT r 
Gfinip 



dlnn 



B din T 

+ 

dlnr dlnr 



(8) 



where G and nip are the gravitational constant and proton mass, 
and /i = 0.609. The mass (with errors) at each radius was 
calculated with the Monte Carlo method described in Pratt & 
Arnaud (2003 1, which takes as input a parametric model for the 
gas density profile and a measured temperature profile with er- 
ror bars. A random temperature at each radius of the measured 
temperature profile is generated assuming a Gaussian distribu- 
tion with sigma equal to the Icr error and a cubic spline in- 
terpolation (between 3 adjacent points) is used to compute the 
derivative. Only 'physical' temperature profiles are kept, i.e. 
those yielding monotically increasing total mass profiles. In to- 
tal 5000 such profiles were calculated. 

At large radii, the errors on the derived mass profile are 
dominated by the statistical errors on the temperature profile. 
However we have to consider possible systematical errors, spe- 
cially in the central (R < 100 kpc) region, where the PSF 
and deprojection corrections introduce noise in the derived 
temperature profile (see Sect. 15. 5> . Only data beyond 30 kpc 
are considered: the temperature of the central bins is highly 
uncertain (see Sect. I5.5.4> . Furthermore, Chandra data have 
clearly revealed sub-structure in the central 6 ~ 20 kpc = 
0.2' region. Below that radius, the hot thermal gas might in- 
teract with the radio halos, producing a non-thermal popu- 
lation of electrons, and therefore the hydrostatic equilibrium 
might be disturbed locally producing possible multiphase states 
for the gas at this spatial scale (see previous work on M87: 
|B6hringer et al. (1995)' "Belsole et al. (200 1)| and PKS 0745- 
191: ,Chen, Ikebe, & Bohringer 2003i ). 

We first derived a reference mass profile. It was computed 
using our best fit BBB model for the gas density profile. For 
the temperature profile we used the PSF-corrected deprojected 
model profile. The profile derived from the spectral fit is too 
noisy to be used: the strong fluctuations observed are inconsis- 
tent with any underlying mass profile. It is difficult to assess the 
statistical errors on this model profile in an objective way. We 
used the errors derived from the spectral fit, which is probably 
a conservative approach. For bin #2 we also add as an error the 
difference between the fitted temperature and the model value 
(the latter might be affected by the too low value derived for 
bin#l). 
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Fig. 9. Integrated total mass distribution. Filled circle: The reference 
total mass profile obtained from the best fit BBB model for the gas 
density profile and the PSF-corrected deprojected model of the tem- 
perature profile (errors are Icr). Open circles: mass obtained from the 
deprojected temperature model profile; Triangles: mass obtained us- 
ing the deprojected temperature profile derived from the spectral fit. 
Crosses: mass using the Chandra gas density profile. 



We then computed the mass profiles obtained using alterna- 
tive temperature and density data, to assess the systematic un- 
certainties. For the gas density profile, we considered the best 
fit BBB model of the Chandra profile (crosses in Fig|9j. The 
differences between the derived mass profile and the reference 
profile are much smaller than the statistical uncertainties. 

We then considered systematic uncertainties due to the tem- 
perature profile determination. We have shown that PSF effects 
in the region considered here are less important than projection 
effects and that pure deprojection introduces much less noise. 
We thus also computed the mass profile derived from the de- 
projected temperature profile (neglecting the PSF blurring). We 
used both the profile derived from the spectral fit (triangles in 
FigEJ the deprojected model profile (open circles in Fig|9}- 
The latter is well within the eiTor bars of the reference profile, 
but the former differs significantly (see for instance the first 
point). Considering that these differences are likely to be rep- 
resentative of the systematic uncertainties due to the PSF cor- 
rection treatment, we add them quadratically to the statistical 
errors on the reference mass profile. 

In the following, we will thus consider the reference mass 
profile with these eiTors bars. 

6.2. Modeling of the total mass profile 

We first tried to fit the data with a King model, where the mass 

r 9i"3/2 
density profile is given by: p(r) oc 1 -h (r/rs) . This model 

(dashed line in Fig. I10> is inconsistent with our data, the is 

= 16.5 for 10 d.o.f. 
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Fig. 10. Modelling of the integrated total mass distribution. Filled cir- 
cle: total mass profile as in Fig.|9| The errors now include systematic 
errors due to the PSF correction (see text). The solid line is the best 
fit NFW profile (c = 4.18), the dotted line is the best MQGSL profile 
and the dashed line is the best fit King model. 



The total mass profile was then fitted using cusped den- 
sity distributions: the Navarro, Frenk & White ( 1 997» pro- 
file (p(r) cc [(r/rs)(l + r/r^)^^ ) and the Moore et al. J1999> 

profile (p(r) oc |^(r/rs)-'^^(l -H r/rs)-'^^j ). These models have 
two free parameters, the central density and the scaling radius, 
or equivalently the total mass M200 (corresponding to a den- 
sity contrast of 200, as compared to the critical density of the 
Universe at the cluster redshift) and the concentration param- 
eter c - /?2()()/'"s. AH useful formulae relating these quantities 
can be found in Pratt & Arnaud (2002i. 

The NFW profile provides a good fit to the data: - 9.5 
for 10 d.o.f. The best fit NFW parameters are: = 492+71 kpc 
and /?200 = 2076 + 106 kpc, corresponding to a concentration 
parameter of c = 4.22 + 0.4 and a total mass enclosed within 
7?2(K) of M200 - 1.1 X 10'^ Mq . The previous errors are quoted 
at 68% confidence level. This best-fit NFW model is shown 
overplotted on the mass profile of the cluster in Fig. |5] The 
upper axis is the radius in units of the derived /?2oo. We are thus 
probing the dark matter shape on a scale from ~ 0.01 to ~ 0.5 
virial radius. 

The alternative MQGSL profile is rejected by our data: = 
29.7/10. It must also be noted that the deviations from the data 
are not only significant at low radii, where the mass estimate is 
most sensitive to systematic eiTors. There is a general deviation 
at large radius, where the model gives essentially a power law, 
while the observed profile shows a significant curvature. 
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Fig. 11. The integrated gas mass fraction as a function of the radius. 
The Icr error bars are obtained from the propagation of the errors over 
the total mass profile. The gas mass fraction profile computed from 
the best fit NFW model is plotted as a solid line. 

6.3. Gas mass fraction profile 

We derived the integrated gas mass fraction profile, /gas , from 
the ratio of the gas mass profile to the total mass profile. The 
gas mass profile is computed from the integration of the best 
fit BBB model for the density profile (see Sect. 14. 2t . The errors 
for the gas mass fraction are obtained from the propagation of 
the total mass uncertainties, the gas mass uncertainties being 
negligible. The resulting gas mass fraction profile is shown in 
Fig in together with the profile derived from the NFW best fit 
model. There is a general increase of /gas with radius, although 
the effect is small over the [0.01-0.5] 7? 200 range. The gradient is 
most pronounced in the cool core region (about 30%), beyond 
which there is only marginal evidence of a positive gradient 
(~ 10% from 0.1 to0.5/?20o). 

Excluding the cool core region (e.g. r < 2'), the average 
value for the gas fraction is fgaAr > 2') = 0.129 ± 0.008. We 
interpolated the gas mass fraction value at r25oo - 822 kpc, 
the radius coiTesponding to a density contrast of 6 = 2500 
(2500 times the critical density of the Universe at the cluster 
redshift), /2500 - 0.13 ± 0.02. This value agrees with the av- 
erage value of 0.113 + 0.013 at the same overdensity derived 



by Allen et al. ( I2002> from a sample of six massive clusters 
observed by Chandra. Our value is also consistent with the 
value derived from the analysis of A1413 (Pratt & Arnaud 



I2002t when scaled to the chosen cosmology of this paper: 
/250o(A1413) = 0.11. However, the A478 and A1413 /gas pro- 
files have different shapes, the latter decreasing more strongly 
toward the center. This reflects differences in the central gas 
density distribution, the total mass profile of both clusters hav- 
ing similar concentration parameters. Such a difference is prob- 
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ably linked to the different thermodynamical states of A1413 
and A478, the latter hosting a strong cooling core, contrary to 
the former. 

7. Discussion and conclusion 

In this paper we have analysed imaging and spatially resolved 
spectral data of the galaxy cluster A478 obtained with the 
XMM-Newton satellite. We obtained well constrained absorp- 
tion, gas density and gas temperature profiles up to ~ 0.5 virial 
radius. 

As in previous studies, we found an excess of absorption 
in the direction of A478. The derived absorbing column den- 
sity exceeds the 21 cm measurements by a factor of ~ 2 in the 
center and the excess extends well beyond the cooling core re- 
gion. This excess of absorption seen in A478 (and other cooling 
flow clusters) was interpreted in pre-XMM and Chandra stud- 
ies (e.g. Allen et al. 119931 IFabian 1994 t as the signature of 
intrinsic cool absorbing material, a consequence of the strong 
cooling flow in the cluster center. From the absorption excess 
extent and a detailed comparison with FIR data, we argue that 
the absorption excess is rather of Galactic origin. We suggest 
it could be the effect of a Galactic molecular/cold cloud type 
structure in the line of sight. The next generation of FIR space 
missions will help to clarify this issue with sensitive FIR map- 
ping of the whole cluster area with a high spatial resolution. 

We fitted the surface brightness profile with various para- 
metric models of the gas density profile, taking into account the 
XMM-Newton PSF. The gas density profile, derived on scales 
of 0.03' - 13', is highly peaked towards the center and is well 
fitted by a quadratic sum of three /3-model. The derived gas 
density profile is in excellent agreement both in shape and nor- 
malization with the Chandra density profile (measured up to 5' 
of the center). This indicates that the PSF modeling we have 
used is basically correct and that accurate density profiles in 
the very center of the cluster can be derived with XMM data, 
in spite of the PSF blurring. 

A raw temperature profile was obtained on scales of 0.07' - 
10' by fitting isothermal models to spectra extracted in 13 con- 
centric annuli. This profile shows a sharp negative gradient 
measured toward the center (r < 2'), a signature of a cooling 
core. Beyond that region the profile is essentially flat. We have 
thoroughly investigated projection and PSF effects on the tem- 
perature profile determination. The PSF effects beyond 0.3' are 
much less important than projection effects, whereas both are 
important in the very center. We discuss the noise introduced 
by the correction of these effects and a way to overcome this 
problem. The derived deprojected PSF-corrected temperature 
profile ranges from ~ 2 keV in the center up to an asymptotic 
value of ~ 6.5 keV. 

Using this temperature profile and the density profile, we 
have derived the total mass profile for this cluster from 0.01 up 
to ~ 0.5 times the virial radius. Systematic uncertainties due 
to the PSF and projection correction for the temperature profile 
are taken into account. We have tested different models for the 
dark matter profile distribution against the observed mass pro- 
file. A mass distribution with a cusp in the center, as predicted 
from numerical simulations, is clearly preferred. 



An isothermal sphere model does not provide a good fit to 
the data. In a second time we tested an MQGSL model and an 
NFW model. Those two types of models have similar shapes 
at large radii (they both scale like r"'-^) but differ significantly 
at small radii. Therefore to discriminate those two models one 
needs data with a high statistic quality over a wide range of 
radii (i.e. covering at least two decades). Our data set nearly 
fulfilled this requirement and we were indeed able to discrimi- 
nate between the two models, the NFW model being preferred 
to the MQGSL model. For the NFW model, we derived a con- 
centration parameter c - 4.2 + 0.4. This value is as expected 
from numerical simulations: c ~ 6 (Navarro, Frenk & White 
[T997I |^e~Navarro, & Frenk 19981 with a typical Icr disper- 
sion of A(log(c)) = 0.18 (Bullock et al. 12^11 '). This work 
can be compared with the similar work on the cluster A 14 13 
by Pratt & Arnaud (2002.,). In the case of A1413, if the NFW 
model was acceptable, the MQGSL model was slightly pre- 
ferred. Although this cluster is detected out to 0.7 times the 
virial radius, the data are limited in the center, a shortcoming, as 
emphasized above, for discriminating between those two mod- 
els. Moreover, data in the center only are not suflicient (see the 
work on A1983 by Pratt & Arnaud 2003 ). On the other hand, 
our result agrees with the analysis of A2029 by Lewis et al. 
(2003), which clearly favors an NFW dark matter profile. To 
our knowledge, this is the only other data set which covers a 
similar wide radial range (0.001 to 0.1 virial radius) . 

The key factors in fitting the mass profiles with different 
dark matter models, are the resolution in the center as well as 
the data at large radii. To date XMM-Newton is the best satellite 
to compute total mass profiles, especially through its capabiUty 
to derive precise temperature profiles. Nevertheless, its spatial 
resolution limits the investigation at the very center of galaxy 
clusters. A direct combined analysis of XMM-Newton and 
Chandra data of very well relaxed clusters seems to be an ideal 
path to a full description of the dark matter profile in clusters. 
However, one has to keep in mind that this requires an excellent 
cross calibration between the two satellites, so that the temper- 
ature profiles derived at various scales can be combined. 
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Appendix A: Extraction of vignetting-corrected 
products from merged event lists 

To correct for vignetting effects we used the photon weighting 
method, described in Arnaud et al. (i2001i . An estimate of the 
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vignetting-coiTected number of counts in a given sky region 
Reg and in a given energy band - Emax is the sum: 



C 



(A.l) 



over all events j with sky position (xj,yj) € Reg and energy 
Ej € [£„,/« - E „,„■(]. The weight coefficient wj is the ratio of 
effective area at the event position to the central effective area 
computed at the event energy. This count extraction is the same 
for individual or merged events lists. We computed the weight 
coefficients by running the task evigweight on each individual 
events list (this task can also be run on the merged events lists). 

Count rate estimates are less straightforward, since the ef- 
fective exposure time can strongly vary in the extraction region 
(from 1 1 ksec in regions only observed with the offset point- 
ings up to 60 ksec in the overlapping region). However, the 
total count rate in a given region is simply the sum of the count 
rates in various sub-regions. The count rate can be written as 



(A.2) 



where t(xj,yj) is the exposure time at the event location. 

In practice we used the following convenient procedure for 
each camera data set: 

1 - We created a mosaic exposure map of the two pointings 
in sky coordinates. The reference position is the same as for 
the merged events list. The exposure map takes into account 
detector regions excluded by the events selection criteria (bad 
pixels, CCD borders . . . ). We used a pixel size of 1.1" x 1.1" 

2 - After merging the events lists, we divided the weight co- 
efficient of each event by the exposure time taken from this 
exposure map. 

3 - Scientific products (spectra, images, profiles) in count rates 
can be readily extracted from the merged events list by binning 
the events weighted by these new weight coefficients. As these 
products are corrected for vignetting, we can then used the on- 
axis response for further physical analysis. 
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